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Abstract 

Background: To date no network meta-analysis (NMA) has accounted for baseline variations in viral load when 
assessing the relative efficacy of interventions for chronic hepatitis B (CHB). We undertook baseline-adjusted and 
unadjusted analyses using the same data to explore the impact of baseline viral load (BVL) on CHB treatment response. 

Methods: We searched Embase, Medline, Medline in Process and the Cochrane CENTRAL databases for randomised 
clinical trials (RCTs) of monotherapy interventions at licensed doses for use in CHB. Search strategies comprised CHB 
disease and drug terms (a combination of controlled vocabulary and free text terms) and also a bespoke RCT filter. 
The NMA was undertaken in WinBUGs using fixed and random effects methods, using data obtained from a systematic 
review. Individual patient data (IPD) from an entecavir clinical trial were used to quantify the impact of different 
baseline characteristics (in particular undetectable viral load (UVL) at 1 year) on relative treatment effect. Study level 
mean baseline values from all identified studies were used. Results were generated for UVL and presented as relative 
risks (RRs) and 95% credible intervals (Cris) using entecavir as reference treatment. 

Results: Overall, for all eight relevant interventions we identified 3,000 abstracts. Following full text review a total of 35 
(including the contents of six clinical study reports) met the inclusion critera; 19 were in hepatitis B e antigen 
(HBeAg)-positive patients and 14 of the 19 contained outcome information of relevance to the NMA. 
Entecavir and tenofovir studies had heterogeneous patient populations in terms of BVL (mean values 9.29 and 8.65 
logio copies/ml respectively). After adjusting UVL for BVL using an informative prior based on the IPD analysis, the 
difference between entecavir and tenofovir was not statistically significant (RR 1.27, 95% CrI 0.96 to 1.47 - fixed effects). 
A similar conclusion was found in all sensitivity analyses. Adjusted tenofovir results were more consistent with observed 
clinical trial response rates. 

Conclusions: This study demonstrates the importance of adjusting for BVL when assessing the relative efficacy of CHB 
interventions in achieving UVL. This has implications for both clinical and economic decision making. 
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Background 

Chronic hepatitis B (CHB) is responsible for about 600,000 
deaths worldwide per year, from end-stage liver disease 
or hepatocellular carcinoma (HCC) [1]. An estimated 350 
to 400 million people have CHB [2], of whom 15 to 40% 
will eventually experience serious complications (hepatic 
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cirrhosis, hepatic decompensation or HCC) [3]. Develop- 
ment of complications is associated with persistent replica- 
tion of the hepatitis B virus (HBV) [2]; hence, an important 
goal of CHB treatment is long-term suppression of HBV 
replication to undetectable levels, as measured by serum 
HBV DNA (virologic response) [2,4]. Normalization of 
serum alanine transaminase (ALT), loss of hepatitis B e 
antigen (HBeAg) and improvement in liver histology are 
other recognized measures of CHB treatment efficacy. 
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Current European clinical guidelines recommend the 
following treatment options for CHB: entecavir, lamivudine, 
telbivudine, adefovir dipivoxil, tenofovir dipivoxil fumarate, 
peginterferon-alfa-2a, interferon-alfa-2a, and interferon- 
alfa-2b [2]. Information on their relative efficacy is import- 
ant in order for healthcare professionals and payers to make 
evidence-based decisions on which treatments to prescribe. 
Because head-to-head comparisons of competing CHB 
treatment options via randomised clinical trials (RCTs) 
are not available for all comparators in HBeAg antigen- 
positive or -negative CHB, indirect evidence in the form 
of network meta-analyses (NMAs) has been used to es- 
timate relative efficacy. NMAs extend conventional, pair- 
wise meta-analysis, and are based on the principle that 
within trial estimates of relative treatment effects can be 
added and subtracted [5,6]. 

An important assumption with NMAs is that the studies 
used are sufficiently similar in terms of relative treatment 
effect modifiers [7] - that is, study-level factors that may 
influence the size of the treatment effect seen with a par- 
ticular pair-wise intervention. These include patient char- 
acteristics, outcomes measured, and study design. Thus, 
to ensure a fair comparison of interventions, it is essential 
to control for differences between studies in terms of 
potential relative treatment effect modifiers. In particular, 
baseline differences in patient characteristics between differ- 
ent trials may distort between-trial comparisons if appropri- 
ate adjustments are not made. 

In CHB, response to treatment varies according to the 
outcome of interest, the agent used, and the patients 
HBeAg status [2]. Patient/disease characteristics that have 
been shown to predict response to treatment in at least 
some categories include baseline viral load, serum ALT 
level, HBV genotype, and activity score on liver biopsy 
[2,4]. Ali and colleagues [8] analysed data from 1,353 pa- 
tients in two RCTs of entecavir and found that higher 
baseline viral load was associated with reduced odds of re- 
sponse to treatment: when baseline viral load (by PCR) 
was treated as a continuous variable, the odds of achieving 
a response were reduced by a factor of 0.38 (62%) for 
every one unit increase in logio PCR above a threshold of 
400 copies/ml. 

Given the absence of head-to-head RCTs for all in- 
terventions, the objective of the current study was to 
generate estimates of relative efficacy of achieving un- 
detectable viral load (UVL) that take into account the 
potential of baseline viral load to act as a treatment ef- 
fect modifier, in order to provide like-for-like compari- 
sons between treatments for CHB that take into account 
the heterogeneity in baseline viral load across patient 
populations in different trials. 

In order to compare the results with previously pub- 
lished NMAs, as well as demonstrate the implications 
for clinical and reimbursement decisions of using such 



estimates, we also generated unadjusted relative efficacy 
estimates using similar methodologies to those used in 
these previous analyses. 

Methods 

We carried out adjusted and unadjusted analyses, using 
the same trial data for each, to explore the impact of 
baseline viral load on treatment response at 1 year. The 
interventions analysed at licensed doses were interferon 
alfa, peginterferon alfa-2a/2b, lamivudine, adefovir dipi- 
voxil, entecavir, tenofovir, telbivudine and also placebo. 
Trials for inclusion in the NMAs were identified through 
a systematic review of the literature. 

The efficacy endpoints analysed in the unadjusted ana- 
lysis were ALT normalization, histological improvement, 
HBeAg seroconversion and achievement of UVL at 1 year. 
Since Ali and colleagues [8] only generated results for one 
endpoint (achievement of UVL at 1 year) the adjusted 
analysis was necessarily restricted to this endpoint/time- 
point. In all analyses, UVL was defined as reduction in 
HBV DNA level (by PCR assay) below the trial specific 
lower level of quantification (LLOQ). 

Systematic review 

We carried out a systematic review of RCTs of the inter- 
ventions listed above. The inclusion criteria were RCTs 
(phase II or III) of monotherapy interventions at licensed 
dose, adults with CHB, reporting any of the endpoints of 
interest, and published in English. Papers (full or other- 
wise) reporting interim results and studies using the inter- 
ventions of interest at non-licensed doses were excluded. 

Searches were carried out in the Embase, Medline, 
Medline in Process and Cochrane CENTRAL databases 
between March and April 2011. No restriction was placed 
on the earliest date of publication and all databases were 
searched from date of inception. Search strategies com- 
prised CHB disease and drug terms (a combination of 
controlled vocabulary and free text terms), and also a be- 
spoke RCT filter. A search was also made for abstracts 
from the European Association for the Study of the Liver 
and the American Association for the Study of Liver Dis- 
eases 2010 and 2011 annual conferences. Search syntax 
for all databases are available on request from the authors. 
The search strategy used to search the Embase database is 
presented in Additional file 1. 

The studies were separated into four clinically distinct pa- 
tient groups: treatment-naive HBeAg-positive or -negative, 
lamivudine refractory and other'. Abstract screening was 
performed by two authors and included in the full paper re- 
view if one reviewer thought it relevant. Formal full paper 
review was undertaken by two reviewers against the pre- 
specified inclusion criteria with a third acting as mediator 
in situations of disagreement. Three authors independently 
extracted study characteristics and the outcome data 
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required for the NMA using a standard form. Discrepancies 
were resolved by one of two other authors. Outcome data 
from weeks 48 and 52 were assumed to refer to 1 year. A 
risk of bias assessment was carried out using Cochrane 
methodology for those RCTs reported as full papers [9]. No 
formal protocol was created for this review. 

Statistical analyses 

Given the lack of head-to-head trial evidence estimating 
the relative efficacy of all licensed interventions, we used 
an NMA approach to synthesise the evidence. In the NMA 
methodology the difference in effect between treatments A 
and B is equal to the difference in effects between treat- 
ments A and C, and B and C. The analysis can be expanded 
to more complex networks of evidence, and can produce 
estimates of both mean effect and uncertainty [10]. 

Fixed effect models were used in the unadjusted analysis. 
For the adjusted analyses we used both fixed and random 
effects models, and final model choice in all analyses was 
based on deviance information criteria (DIG) [11]. 

Ghoice of prior distribution for parameters in NMA 
models is an important consideration, especially in the 
presence of sparse networks of evidence. Uninformative 
priors were used in the unadjusted analyses for all model 
parameters. In all covariate-adjusted analyses the results 
from Ali and colleagues [8] were used to inform the 
prior distribution on the regression coefficient associated 
with baseline viral load. Variations in baseline viral load 
in each study arm (where there were differences) were 
incorporated into the adjusted analyses via the use of the 
average baseline HBV DNA value across arms within a 
given RGT. 

Baseline viral load was assumed to modif)^ treatment 
effects relative to entecavir (0.5 mg), which was also used 
as the baseline against which all relative efficacy estimates 
were calculated. To make the results easier to interpret by 
a non-statistical audience, we represented relative efficacy 
as a relative risk (RR) of response instead of the more nat- 
ural odds ratio. We reported the mean of the posterior 
probability distribution as well as the 95% credible interval 
(GrI) for each RR. When the 95% GrI did not include the 
value one, the RR was considered significantly different to 
that for entecavir. 

In order to compare the results of the analyses with 
the input data, as well as presenting the output in an in- 
tuitive manner, we also generated the absolute predicted 
posterior probabilities of response for each clinical out- 
come and treatment combination. In the adjusted NMA 
we also undertook a range of sensitivity analyses whereby 
in addition to the use of fixed and random effects models, 
the impact of adding or removing individual studies due 
to heterogeneity was assessed. Gaterpillar, density and 
Brooks-Gelman-Rubin plots were examined in all analyses 
to ensure model convergence. 



The analyses were conducted in WinBUGS Version 1.4 
(MRG Biostatistics Unit, Gambridge, UK) [12] using Bayesian 
Markov Ghain Monte-Garlo Gibbs sampling methods. 

Results 

Search results and summary of studies 

Our search of clinical databases identified 3,000 abstracts; 
179 articles, including clinical study reports (GSRs), were 
ordered/requested for review, of which 35 (six GSRs) met 
the inclusion criteria [13-48]. The contents of five of the 
GSRs had been reported in peer reviewed publications 
already captured by the search and hence the published 
data were used [18,26,37,42,44]. One GSR (BMS study 
AI463023) [13] and the Summary of Product Gharacteris- 
tics for telbivudine [22] were included in the review. In 
total, the review identified 29 unique trials. Of these, 19 
contained information in HBeAg-positive patients, and 14 
of the 19 reported enough information to warrant inclu- 
sion in a NMA, and 13 reported information on UVL at 
that timepoint [13-15,18,20-24,26,28,30,32,48]. 

The study selection process is presented as a PRISMA 
diagram in Figure 1. The PRISMA 2009 checklist is repor- 
tedin Additional file 2. Study characteristics and reported 
UVL at 1 year (defined as either 48 or 52 weeks) are shown 
in Table 1. The assessment of study quality undertaken as 
part of the systematic review is reported in Additional file 
1: Table SI. Studies identified by the systematic review used 
a range of LLOQ values from 1,000 to 200 copies/ml. 

Unadjusted network meta-analysis 

The network of evidence used to generate all results is 
presented in Figure 2. The results of the fixed effects 
analysis are presented as relative risks in Table 2 and ab- 
solute probabilities of response in Additional file 1: Table 
S2. There was only one instance where a treatment per- 
formed significantly better than entecavir: the RR for 
tenofovir achieving UVL was 1.43 (95% GrI 1.30 to 1.54). 
With the exception of telbivudine which demonstrated no 
statistically significant difference to entecavir (RR 0.88, 
95% GrI 0.76 to 1.00) all other interventions performed 
significantly less well than entecavir. 

Adjusted network meta-analysis 

The primary adjusted analysis of achieving UVL at 1 year, 
when accounting for baseline viral load, was undertaken 
using materials available in the public domain (the "base 
case"). Thus, the material extracted from the GSR was 
excluded. In addition, the baseline rates for two studies 
were very different to the remainder in that they were 
assessed using a different assay with very different LLOQ 
definitions suggesting that baseline data were collected 
in a different manner to all other studies [16,24]. These 
studies were also excluded from the base case analysis. 
One study, TBVIG, reported median rather than mean 
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3000 records identified through database searching: 
1317EMBASE 

770 MEDLINE® and MEDLINE® In-Process 
913 CENTRAL 



2113 records 293 conference abstracts identified 

following 80 from EASE 2010 and 201 1 213 

de-duplication from AASLD 2009 and 2010 



2406 records screened for eligibility 
2113 full papers 
293 conference abstracts 




2224 excluded at abstract and title review 
1934 full papers 
290 conference abstracts 


► 



179 full text articles/ Clinical Study Reports 
screened for eligibility 






35 distinctive items inc] 
synthesis 


uded in qualitative 



149 full text articles excluded 
9 not appropriate patient group 
49 not full paper publication 

4 no appropriate endpoints reported 
45 not appropriate drug in trial 

7 not English Language 

21 not randomised controlled trial 

9 follow up/further analyses 

5 previously reported trials 



29 unique trials included 







14 trials in HBeAg-positive patients 
considered for inclusion in the NMA 



10 studies excluded for not being 
performed in HBeAg positive CHB 
5 studies excluded for not reporting 
endpoint data at one year 



Figure 1 PRISMA diagram of studies included in the systematic review. AASLD, American Association for tine Study of Liver Diseases; CHB, 
clironic liepatitis B; EASL, European Association for tine Study of tine Liver; HBeAg, liepatitis B e antigen; NMA, networl< meta-analysis. 



baseline viral load and was hence also excluded from the 
base case-adjusted analysis [30] . Information on this study 
is provided in Table 1. Data from the ten studies that re- 
ported baseline viral load were used in the adjusted ana- 
lyses [14,15,18,20-23,26-28,31,48]. 

The results are presented as relative efficacy estimates in 
the second column of Table 3 and as absolute probabilities 
of UVL at 1 year in Figure 3 (fixed effects) and Figure 4 
(random effects). 

The relative risk estimates produced by the fixed and ran- 
dom effects base case analyses were very similar. In particu- 
lar, entecavir produced significantly increased RRs of UVL 
at 1 year compared with all interventions except telbivudine 
and tenofovir, for which the likelihood was similar. In con- 
trast to the unadjusted analysis, the relative efficacy of ente- 
cavir and tenofovir for achieving UVL was not significantly 
different. Thus, baseline viral load is a significant moderator 
of the effects of monotherapies for CHB. The DIG estimate 
for the fixed effect model was lower than that for the ran- 
dom effects model and was thus the preferred approach. 

Adjusted network meta-analysis: sensitivity analyses 
Exclusion of data from one adefovir study 

The reported 1 year UVL rate for adefovir patients 
as reported by the 018 Study Group is approximately 



two to three times higher than reported for adefovir 
in all other studies (Table 1). In contrast, the absolute 
response rate for patients receiving telbivudine re- 
ported by this study was in line with that observed in 
the other studies. The impact of removing this study 
is presented in column three of Table 3, and in 
Figures 3 and 4. 

The DIG statistics for both fixed and random effects 
models were similar, with the random effects analysis repre- 
senting the best fitting model. While the results overall are 
similar to those in the base case analyses, the greatest im- 
pact is observed in the tenofovir results, with a relative risk 
value of 1.08 (95% GrI 0.22 to 1.52). Of note, the derived 
absolute response probabilities for entecavir and tenofovir 
in this scenario were 65.9% and 71.4%, respectively (ran- 
dom effects model). The corresponding values in the key 
regulatory trials were 66.7% and 76.5%, respectively. 

Exclusion of data from the 018 Study Group, and inclusion 
of data from AI463023 and TBVIG 

The systematic review identified two additional studies 
which contained information of potential interest: as yet 
unpublished data from BMS study AI463023 and median 
baseline values from the TBVIG study [13,29]. When 
these data were included, but the data from 018 remained 



Table 1 Study characteristics and 1-year outcomes of studies included in the network meta-analysis (HBeAg-positive patients only) 



Source 


Treatment 
duration 


Study design 


Number 
of patients 


Treatment 


Age 
(years) 


Male 

10/ \ 


Endpoint 
timepoint 


Method used to 
measure HBV DNA 


LLOQ 


Baseline viral load 
(log-io copies/ml) 


UndetectabI 
uiNA at 1 y€ 


018 Study Group [13] 


52 weeks 


Randomised, 


45 


TBV 


34 


78 


52 weeks 


Amplicor PGR 


300 copies/ml 


9.57 


60 






controlled, open label 




600 mg 








Assay (Roche) 














44 


ADV 1 0 mn 

/ALy V 1 W 1 1 iKJ 


30 


91 








9.98 


40.9 


ADV 437 Study 


48 weeks 


Randomised, 


167 


Placebo 


37 


71 


48 weeks 


Amplicor PGR 


400 copies/ml 


8.12 


0 


Group [14] 




single blind 


171 


ADV 1 0 mg 


34 


76 




Assay (Roche) 




8.25 


21.1 


AHLSG [15] 


52 weeks 


Randomised, 


72 


Placebo 


29 


72 


52 weeks 


Solution hybridising 


1.6 pg/ml 


1.85 


NR 






double blind 


143 


LAM 


31 


74 




assay (Abbott) 




1.8 


NR 










100 mg 
















AI463023 [12] 


96 weeks 


Phase 3 randomised. 


225 


EW 0.5 mg 


- 


- 


52 weeks 


PGR assay (company 


300 copies/ml 


8.80 


73.8 






double blind 


221 


LAM 


- 


- 




unspecified) 




8.70 


37.6 










100 mg 
















BeHoLD_l [1 7] 


60 weeks 


Phase 3 randomised. 


354 


EW 0.5 mg 


35 


77 


48 weeks 


Amplicor PGR 


300 copies/ml 


9.62 


66.7 






double blind 


355 


LAM 


35 


74 




Assay (Roche) 




9.69 


36.3 










100 mg 
















Globe study group 


NR 


Phase 3 randomised. 


463 


LAM 


33 


76 


52 weeks 


Amplicor PGR 


300 copies/ml 


9.50 


40.4 


[19-21] 




double blind 




100 mg 








Assay (Roche) 














458 


TBV 


32 


73 








9.50 


60 










600 mg 
















Hou [22] 


1 04 weeks 


Phase 3 randomised. 


147 


TBV 


28 


80 


52 weeks 


Amplicor PGR 


300 copies/ml 


9.30 


66.7 






double blind 




600 mg 








Assay (Roche) 














143 


LAM 


29 


75 








9.70 


37.8 










100 mg 
















ILSG [23] 


52 weeks 


Randomised, partially 


82 


LAM 


30 


71 


52 weeks 


Solution hybridising 


1.6 pg/ml 


2.04 


60 






double blind 




100 mg 








assay (Abbott) 














uy 


IFNA 


DZ. 


9.1 

O 1 








1 ./ o 


7Q 1 


Lau [47] 


72 weeks 


Phase 3 randomised. 


271 


PeglFNA 


32.5 


79 


48 weeks 


Amplicor PGR 


400 copies/ml 


9.90 


25.1 






Hni il^jp l^linH 


272 


LAM 


31.6 


79 




/xjoay Icy 




10.10 


39.7 










1 nn mn 

1 uu 1 1 ly 
















Leung [25] 


Minimum 52 weeks 


Phase 3 randomised. 


33 


EW 0.5 mg 


37 


61 


48 weeks 


Amplicor PGR 


300 copies/ml 


10.30 


57.6 






nnpn lahpl 


32 


ADV 10 mg 


32 


66 




A<^<^?i\/ (Rnrhpl 




9.88 


18.8 


Marcellin [26] 


48 weeks 


Phase 3 randomised. 


176 


TDF 


34 


68 


48 weeks 


Gobas Taq-Man PGR 


169 copies/ml 


8.64 


76.1 






double blind 




300 mg 








Assay (Roche) 














90 


ADV 10 mg 


34 


71 








8.88 


13.3 


Ren [27] 


48 weeks 


Randomised 


21 


LAM 


34 


52 


48 weeks 


PGR assay (company 


Unspecified 


8.49 


38 










100 mg 








unspecified 














21 


EW 0.5 mg 


31 


57 








8.52 


71.4 



Table 1 Study characteristics and 1-year outcomes of studies included in the network meta-analysis (HBeAg-positive patients only) (Continued) 



TBVIG [29] 



USLIG [30] 



52 weeks 



1 weeks 



Phase 2 randomised, 
double blind 



Prospective, 
randomised, 
double blind 



19 

22 

71 
66 



LMV 
100 mg 

TBV 
600 mg 

Placebo 

LAM 
100 mg 



34 

40 

38 
40 



74 52 weeks 
82 

80 52 weeks 



Amplicor PGR 200 copies/ml 
Assay (Roche) 



Unspecified 



Unspecified 



N/R 

N/R 

5.70 
10.20 



31.6 

61.4 

15.9 
44.4 



The systematic review identified 21 studies reporting hepatitis B e antigen (HBeAg)-positive patients; this table only shows the 14 studies Included In the network meta-analysis. The remaining 7 studies did not report 
outcomes at 1 year and so were not Included In the network meta-analysis. ^Patient numbers for overall population provided, baseline viral load and hepatitis B virus (HBV) DNA reported as HBeAg-posltlve/HBeAg- 
negatlve, outcomes for HBeAg-posltlve patients only. ADV, adeforvir; ETV, entecavir; IFNA, interferon-alfa; LAM, lamivudine; LLOQ, lower level of quantification; NR, not reported; PeglFNA, pegylated interferon-alfa 2a; 
TBV, telblvudlne; TDF, tenofovlr. 
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Peg Interferon 



Figure 2 Evidence networlcs of studies used to generate unadjusted results for the undetectable viral load endpoint. 



excluded, the corresponding results from this analysis are 
presented in column four of Table 3, and in Figures 3 and 4. 

The random effects analysis generated the lowest DIG 
and is therefore the preferred model There was no sig- 
nificant difference between the relative efficacy of ente- 
cavir, telbivudine and tenofovir, but entecavir performed 
significantly better than all other interventions. These 
results are in contrast to the unadjusted results. The 
absolute probabilities derived using a random effects 
model for tenofovir, telbivudine and entecavir were simi- 
lar to those observed in the landmark RCTs. 



Discussion 

NMA can be used to generate relative efficacy estimates 
of competing treatments in situations where more than 
two treatment options are available and direct head-to- 
head evidence from RCTs does not exist for all compara- 
tors. The NMA approach allows all relevant evidence to 
be considered and addresses research questions in the 
absence of direct comparative evidence, improving the 
precision of estimates by combining direct and indirect 
evidence. 



Table 2 Unadjusted efficacy estimates relative to 


treatment with entecavir 




Fixed effects model 


Entecavir 0.5 mg 


1.00 (Baseline) 


Adefovir 10 mg 


0.64 (0.42, 0.87) 


Lamivudine 100 mg 


0.54 (0.46, 0.63) 


Placebo 


0.10 (0.04, 0.19) 


Telbivudine 600 mg 


0.88 (0.76, 1 .00) 


Tenofovir 245 mg 


1.43 (1.30, 1.54) 


Interferon alfa 


0.21 (0.09, 0.38) 


Peginterferon alfa-2a/2b 


0.34 (0.23, 0.46) 



Results are shown as relative risk (95% credible interval). A relative risk <1 
should be interpreted as a given treatment being less efficacious (that is, 
worse) than entecavir, and a value >1 being more efficacious (that is, better). 



One of the key assumptions underpinning this method 
is that the studies included in the analysis are homoge- 
neous (that is, the trials are sufficiently similar on study 
and patient characteristics). The similarity assumption is 
violated if one or more study-level covariates act as 
modifiers of the relative treatment effects and their dis- 
tribution is not balanced across the studies being com- 
pared [49,50]. In this case, NMA may be affected by 
confounding bias, unless one explicitly controls for these 
covariates in the statistical analyses. 

Controlling for covariates is particularly important in 
cases where response to treatment is defined in terms of 
post-treatment level of a measure, and when that base- 
line level of this measure is known to vary across studies. 
If one study recruits patients with worse levels of a vari- 
able that is known to modif)^ the relative impact of treat- 
ment, then the level of response achieved is likely to be 
smaller compared with another study which primarily 
includes patients with better baseline levels, other things 
being equal. 

The motivation for our work was the belief that such 
baseline covariate imbalances had occurred for patients 
recruited into studies looking at interventions for CHB. 
In particular, it was noted that there were differences in 
mean baseline viral load (expressed in terms of logio 
copies/ml when measured using the PGR assay) with 
values for entecavir and tenofovir differing by approxi- 
mately 1 logio copies/ml (Table 1). We hypothesised that 
failure to account for these differences in previous ana- 
lyses may have led to biased estimates of relative efficacy. 

The work contained in this paper supports this hy- 
pothesis. When no adjustment was made to account for 
differences in baseline viral load among trials, tenofovir 
was shown to be significantly better than entecavir in 
terms of achieving UVL at 1 year (fixed effects RR 1.43, 
95% CrI 1.30 to 1.54). However, when we accounted for 
the impact of baseline viral load the difference between 
the two treatments was not significant (fixed effects RR 
1.27, 95% CrI 0.96 to 1.47; random effects RR 1.21, 95% 
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Table 3 Adjusted relative risk estimates for virologic response, expressed as relative risk of achieving undetectable 
viral load 



Intervention 


Base case 


Base case without 
018 study group 


Base case without 018 study group; 
with AI4463023 + TBVIG 


Fixed effects analyses 








Entecavir 0.5 mg 


1.00 (Baseline) 


1.00 (Baseline) 


1.00 (Baseline) 


Adefovir 10 mg 


0.33 (0.14, 0.62) 


0.23 (0.05, 0.56) 


0.33 (0.10, 0.70) 


Lamivudine 100 mg 


0.38 (0.22, 0.58) 


0.38 (0.22, 0.58) 


0.52 (0.41,0.64) 


Placebo 


0.01 (0.00, 0.03) 


0.00 (0.00, 0.02) 


0.01 (0.00, 0.03) 


Telbivudine 600 mg 


0.67 (0.43, 0.92) 


0.67 (0.44, 0.92) 


0.86 (0.71, 1.01) 


Tenofovir 245 mg 


1 .27 (0.96, 1 .47) 


1.12 (0.61, 1.43) 


1 .25 (0.84, 1 .48) 


Interferon alpha 


0.14(0.05, 0.29) 


0.13 (0.05, 0.29) 


0.20 (0.08, 0.37) 


Peginterferon alpha-2a/2b 


0.23 (0.11,0.39) 


0.22 (0.11, 0.39) 


0.32 (0.21,0.46) 


Residual deviance 


19.48 


17.10 


21.54 


Die 


35.56 


32.14 


39.15 


Random effects analyses 








Entecavir 0.5 mg 


1.00 (Baseline) 


1.00 (Baseline) 


1.00 (Baseline) 


Adefovir 10 mg 


0.33 (0.07, 0.82) 


0.27 (0.03, 0.95) 


0.29 (0.04, 0.78) 


Lamivudine 100 mg 


0.37 (0.09, 0.84) 


0.42 (0.07, 1 .09) 


0.51 (0.23,0.86) 


Placebo 


0.01 (0.00, 0.06) 


0.01 (0.00, 0.07) 


0.01 (0.00, 0.04) 


Telbivudine 600 mg 


0.64 (0.18, 1.19) 


0.72 (0.12, 1.39) 


0.86 (0.41, 1.27) 


Tenofovir 245 mg 


1.21 (0.48, 1.51) 


1 .08 (0.22, 1 .52) 


1.15 (0.39, 1.50) 


Interferon alpha 


0.16(0.01,0.67) 


0.20 (0.01, 0.94) 


0.22 (0.03, 0.67) 


Peginterferon alpha-2a/2b 


0.25 (0.03, 0.88) 


0.30 (0.02, 1.13) 


0.34 (0.07, 0.88) 


Residual deviance 


18.23 


16.12 


19.78 


Die 


35.86 


32.10 


39.07 



Results are shown as relative risk (95% credible Interval). "Base case" refers to the adjusted analysis undertaken using the 10 studies listed in the document 
containing appropriate information. As with the unadjusted analyses, a relative risk <1 should be interpreted as a given treatment being less efficacious (that is, 
worse) than entecavir and a value >1 being more efficacious (that is, better). DIC, deviance information criteria. 



CrI 0.48 to 1.51). The fixed effects adjusted model best fit- 
ted the underlying data, although the difference was minor 
(fixed effects DIC, 35.56; random effects DIC, 35.86). 

Sensitivity analyses highlighted that the relative effi- 
cacy of tenofovir versus entecavir was contingent on the 
choice of studies included in the meta-analysis, and in 
particular whether or not data reported by one study 
group [13] were used. When these data were excluded, 
there is no significant difference between the two inter- 
ventions (RR 1.08, 95% CrI 0.22 to 1.52). A subsequent 
sensitivity analysis, whereby this study was removed but 
two other studies were included (AI463023 and TBVIG), 
generated similar non-significant results (RR 1.15, 95% 
CrI 0.39 to 1.50). In both sensitivity analyses the most 
appropriate model, based on DIC, consisted of random 
as opposed to fixed effects approaches. Close examin- 
ation of the published paper [14] has identified no rea- 
son why this result should occur, and so there may be 
some other form of study level heterogeneity as yet un- 
accounted for that is influencing the results. 



Our paper is the first to generate baseline viral load 
adjusted and unadjusted NMA results using data from 
the same set of studies, and the results from the un- 
adjusted analyses are very similar to those generated by 
other research groups [51,52]. Accepting that NMA is 
based on relative efficacy, the results from all three un- 
adjusted analyses for UVL appear to be at odds with 
those provided by the clinical trials included in the NMA. 
The systematic review identified one study of tenofovir 
[27] and the observed response rate was 76%. The corre- 
sponding value arising from our NMA was 93.2% (95% 
CrI 85.6% to 97.6%). Similar values were generated by two 
other research groups [51,52]. One other NMA has been 
recently published [53]. This analysis, however, contains a 
number of methodological flaws, the most notable being 
the pooling of data from HBeAg-positive and -negative in- 
dividuals. We have therefore not extracted results from 
this paper for the purposes of discussion. 

In contrast, with the exception of placebo and interferon- 
based therapies, the Cris for the values derived in the 
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Figure 3 Absolute probability of undetectable viral load at 1 year (fixed effects). Basecase refers to the adjusted analysis undertaken 
using the ten studies listed in the document containing appropriate information. 



adjusted analyses all contain the observed trial values, and 
the RR estimates are close to the trial values once the 018 
Study Group data are removed (Figures 3 and 4). Hence, 
we would argue that the adjusted results are of greater clin- 
ical relevance than the unadjusted results. 



Generating like-for-like' estimates of relative efficacy 
by controlling for covariates believed to be modifiers of 
relative treatment effects is not just of clinical interest 
but is essential for the purposes of reimbursement deci- 
sions. Such estimates are used by agencies such as the 



90% 



80% 



^ 60% 



c 50% 
o 



fl, 40% 

3 
O 

(0 

g 30% 

20% 
10% 



■ Key trial data 
O Unadjusted 
X Basecase 



O Basecase without 018 study group 
A Basecase without 018 study group; 
with AI4463023 + TBVIG 



6^ 



it "l 



Tenofovir 
245mg 



Entecavir 
0.5mg 



Telbivudine Lamivudine 
100mg 



Adefovir 
10mg 



Peginterferon Interferon 
alpha-2a/2b alpha 



Figure 4 Absolute probability of undetectable viral load at 1 year (random effects). Basecase refers to the adjusted analysis undertaken 
using the ten studies listed in the document containing appropriate information. 
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National Institute for Health and Care Excellence in their 
appraisal processes when assessing the clinical efficacy in a 
given disease area [54]. In addition, such values are also 
used in economic models to evaluate the cost-effectiveness 
of interventions. A number of such models have been de- 
veloped in CHB [55-59], of which one [57] used the results 
from their unadjusted analysis directly as model inputs. 
Another [59] used UVL as a surrogate variable for risk of 
cirrhosis using information from the REVEAL-HBV study 
[60], which quantified the relationship between HBV DNA 
and the likelihood of being diagnosed with cirrhosis. Over- 
estimation of virologic response would thus correspond to 
underestimation of the likelihood of cirrhosis, which has 
been identified as a key driver of cost-effectiveness. 

Despite the review finding a decent number of studies 
overall, as can be seen from Figure 2, the presence of a 
large number of treatment options means that the ma- 
jority of the branches in the network are informed by 
the findings of a single study. This increases the uncer- 
tainty surrounding all results and means that baseline 
imbalances in other potential treatment effect modifiers 
may have influenced the results. 

Further work is needed to complement the work con- 
tained in this paper in connection with the achievement 
of UVL at 1 year in order to explore the impact of other 
potentially clinically relevant covariates on the relative 
effects of comparators and the probability of achieving 
UVL. Exploring the impact of other areas of potential 
heterogeneity (for example, study design, impact of dif- 
ferent LLOQ definitions) is also important. In addition, 
Ali and coUagues [8] identified the time of assessment as 
a treatment effect modifier in addition to baseline viral 
load. The studies included in this analysis were very simi- 
lar in terms of assessment times and so the exclusion of 
this variable is likely to have had a modest effect. None- 
theless, it would be interesting to replicate the analyses 
contained in this paper when controlling for these 
slight differences. Furthermore, expanding this type of 
analysis to other clinically relevant endpoints is also 
worthwhile. 



Conclusions 

The analysis showed that baseline viral load is a treat- 
ment effect modifier in CHB and that failure to cor- 
rect for this variable inflates the relative efficacy estimates 
for some interventions. Since these estimates are often 
used in economic models to generate cost-effectiveness 
estimates, failure to adjust for baseline viral load will gen- 
erate erroneous ICERs, resulting in poor use of scarce 
healthcare resources. As such, reimbursement agencies 
should therefore only use covariate-adjusted relative 
efficacy estimates in their decision making surrounding 
treatments for CHB. 
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